Wrappers for Feature Subset Selection
نویسندگان
چکیده
In the feature subset selection problem a learning algorithm is faced with the problem of selecting a relevant subset of features upon which to focus its attention while ignoring the rest To achieve the best possible performance with a particular learning algorithm on a particular training set a feature subset selection method should consider how the algorithm and the training set interact We explore the relation between optimal feature subset selection and relevance Our wrapper method searches for an optimal feature subset tailored to a particular algorithm and a domain We study the strengths and weaknesses of the wrapper approach and show a series of improved designs We compare the wrapper approach to induction without feature subset selection and to Relief a lter approach to feature subset selection Signi cant improvement in accuracy is achieved for some datasets for the two families of induction algorithms used decision trees and Naive Bayes
منابع مشابه
A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems
Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...
متن کاملA Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems
Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...
متن کاملWrapper Feature Selection
INTRODUCTION It is well known that the performance of most data mining algorithms can be deteriorated by features that do not add any value to learning tasks. Feature selection can be used to limit the effects of such features by seeking only the relevant subset from the original features (de Souza et al., 2006). This subset of the relevant features is discovered by removing those that are cons...
متن کاملTrace Ratio Criterion for Feature Selection
Fisher score and Laplacian score are two popular feature selection algorithms, both of which belong to the general graph-based feature selection framework. In this framework, a feature subset is selected based on the corresponding score (subset-level score), which is calculated in a trace ratio form. Since the number of all possible feature subsets is very huge, it is often prohibitively expens...
متن کاملA New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum
Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...
متن کاملA Hybrid Feature Selection Method based on IGSBFS and Na�e Bayes for the Diagnosis of Erythemato - Squamous Diseases
In this paper a diagnostic model based on Naive Bayes developed to diagnose erytemato squamous diseases. The hybrid feature selection method, named IGSBFS (Information Gain and Sequential Backward Floating Search), combines the advantages of filters and wrappers to select the optimal feature subset from the original feature set. In IGSBFS, Information Gain acts as filters to remove redundant fe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Artif. Intell.
دوره 97 شماره
صفحات -
تاریخ انتشار 1997